A fast approach to psychoacoustic model compensation for robust speaker recognition in additive noise
نویسنده
چکیده
This paper addresses the problem of speaker verification in the presence of additive noise. We propose a fast implementation of Psychoacoustic Model Compensation (Psy-Comp) scheme for static features along with model domain mean and variance normalization for robust speaker recognition in noisy conditions. The proposed algorithms are validated through experiments on noise corrupted NIST-2000 speaker recognition database. We show that the Psy-Comp scheme along with model domain mean and variance normalization provide significant performance gain compared to the Vector Taylor Series (VTS) scheme and feature domain cepstral mean and variance normalization scheme. Moreover, the computational cost of the proposed method is significantly less than the VTS scheme.
منابع مشابه
Predictive model-based compensation schemes for robust speech recognition
For practical applications speech recognition systems need to be insensitive to diierences between training and test acoustic conditions. Diierences in the acoustic environment may result from various sources, such as ambient background noise, channel variations and speaker stress. These diierences can dramatically degrade the performance of a speech recognition system. A wide range of techniqu...
متن کاملImproved model parameter compensation methods for noise-robust speech recognition
In this paper we study model parameter compensation methods for noise-robust speech recognition based on CDHMM. First, we propose a modified PMC method where adjustment term in the model parameter adaptation is varied depending on mixture components of HMM to obtain more reliable modeling. A statedependent association factor that controls the average parameter variability of Gaussian mixtures a...
متن کاملAdditive and convolutional noises compensation for speaker recognition
It is well known that the performances of speaker identification systems degrade rapidly as the mismatch between training and test conditions increases. In this work we present a noise compensation technique whose goal is to minimize the effects of such mismatch, so as to obtain an identification accuracy as close as possible to that obtained under matched conditions. To reduce this mismatch, t...
متن کاملRobust Speaker Recognition Using MAP Estimation of Additive Noise in i-vectors Space
In the last few years, the use of i-vectors along with a generative back-end has become the new standard in speaker recognition. An i-vector is a compact representation of a speaker utterance extracted from a low dimensional total variability subspace. Although current speaker recognition systems achieve very good results in clean training and test conditions, the performance degrades considera...
متن کاملRobust speaker recognition based on high order cumulant
LP-derived cepstral coefficients are sensitive to additive noise in speech signal. In this paper, an approach to extracting speech feature based on the high-order cumulant is proposed to depress the effect of additive noise in speech signal. The performance of this approach is evaluated using a text-prompt speaker verification system. Experimental results show that this approach is effective to...
متن کامل